First-arrival picking is an essential step in seismic data processing, which can directly affect the accuracy of normal moveout correction, static correction and velocity analysis. At present, affected by background noise and complex near-surface conditions, the picking accuracies of the existing methods are reduced. Based on this, a First-arrival automatic Picking algorithm based on Clustering and Local linear regression (FPCL) was proposed. This algorithm was implemented in two stages: pre-picking and fine-tuning. In the pre-picking stage, the k-means technique was firstly used to find first-arrival cluster. Then the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) technique was used to pick first-arrivals from the cluster. In the fine-tuning stage, the local linear regression technique was used to fill in missing values, and the energy ratio minimization technique was used to adjust error values. On two seismic datasets, compared with Improved Modified Energy Ratio (IMER) method, FPCL had the accuracy increased by 4.00 percentage points and 3.50 percentage points respectively; compared with Cross Correlation Technique (CCT), FPCL had the accuracy increased by 38.00 percentage points and 10.25 percentage points respectively; compared with Automatic time Picking for microseismic data based on a Fuzzy C-means clustering algorithm (APF), FPCL had the accuracy increased by 34.50 percentage points and 3.50 percentage points respectively; compared with First-arrival automatic Picking algorithm based on Two-stage Optimization (FPTO), FPCL had the accuracy increased by 5.50 percentage points and 16.25 percentage points respectively. The above experimental results show that FPCL is more accurate.
Concerning the problems of the difficulties in obtaining, the limitation of labels, and the high cost of labeling of shale gas reservoir data, a Multi-standard Active query Multi-label Learning (MAML) algorithm was proposed. First of all, with the consideration of the informativeness and representativeness of the samples, the preliminary processing was performed on the samples. Secondly, the sample richness constraints including attribute differences and label richness were added, on this basis, the valuable samples were selected and the labels of these samples were queried. Finally, a multi-label learning algorithm was used to predict the labels of the remaining samples. Through experiments on eleven Yahoo datasets, the MAML algorithm was compared with popular multi-label learning algorithms and active learning algorithms, and the superiority of the MAML algorithm was proved. Then, the experiments were extended to four real shale gas well logging datasets. In these experiments, compared with the multi-label learning algorithms: Multi-Label Multi-Label K-Nearest Neighbor (ML-KNN), BackPropagation for Multi-Label Learning (BP-MLL), multi-label learning with GLObal and loCAL label correlation (GLOCAL) and active learning by QUerying Informative and Representative Examples (QUIRE), the MAML algorithm improved the average prediction accuracy of comprehensive quality of shale gas reservoirs by 45 percentage points, 68 percentage points, 68 percentage points, and 51 percentage points, respectively. The practicability and superiority of the MAML algorithm in the prediction of shale gas reservoir sweet spots are fully proved by these experimental results.
Some researchers suggest replacing concept lattices with concept sets in recommendation field due to the high time complexity of concept lattice construction. However, the current studies on concept sets do not consider the role of approximate concepts. Therefore, approximate concepts were introduced into recommendation application, and a genetic algorithm based Approximate Concept Generation Algorithm (ACGA) and the corresponding recommendation scheme were proposed. Firstly, the initial concept set was generated through the heuristic method. Secondly, the crossover operator was used to obtain the approximate concepts by calculating the extension intersection set of any two concepts in the initial concept set. Thirdly, the selection operator was used to select the approximate concepts meeting the conditions according to the similarity of extensions and the relevant threshold to update the concept set, and the mutation operator was adopted to adjust the approximate concepts without meeting the conditions to meet the conditions according to the user similarity. Finally, the recommendation to the target users was performed according to the neighboring users’ preferences based on the new concept set. Experimental results show that, on four datasets commonly used by recommender systems, the approximate concepts generated by ACGA algorithm can improve the recommendation effect, especially on two movie scoring datasets, compared with Probabilistic Matrix Factorization (PMF) algorithm, ACGA algorithm has the F1-score, recall and precision increased by nearly 78%, 104% and 57% respectively; and compared with K-Nearest Neighbor (KNN) algorithm, ACGA algorithm has the precision increased by nearly 12%.
MBSA(Map-based BitSet Associaition Rule) algorithm was presented which used TreeMap class and a compressed BitSet class in Java to store Boolean values. MBSA algorithm scanned the transaction database only once and further database scans were replaced by BitSet logical AND operation, which efficiently speeded up the computation. MBSA algorithm had been applied to mine the association rules of red, green and blue bands associated with crop yield from remote sensing image of crop. It is useful for improve crop production.
A PDAS(Pattern Discovery and Analyzing System) structure for finding user access patterns was designed according to characteristics of Web information. Based on association rule theory, the processing models and algorithms of single-user k-sequence frequent access patterns were presented. Experiments show that frequent access patterns mined by algorithms can assist decision-making to some extent.